A quick tutorial on PDF Transformation 



Written by: Chieh Wu 
This tutorial assumes: 

• The reader already understand the concept of PDF, CDF, random vari- 
ables. 

This tutorial will: 

• Show you how to transform a PDF function into another one 

• How to generate random variables using the uniform distribution. 

Let us start with a random variable X. This random variable can be anything. For 
the sake of our example let us use a uniform distribution. Let's say we are waiting for 
the arrival of a bus. The bus can show up between 6:00 and 6:10. The distribution for 
the bus to show up is therefore uniform between 10 minutes with the probability of 1/10. 
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Now let's assume we have a PDF of Y such that Y = 2X. To give you a more concrete idea 
of what the equation Y=2X means, it is now basically saying that the bus can show up any- 
where between 6:00 to 6:20. Before the possible outcomes of X were between 0 to 10 minutes. 
Now the possible outcomes of Y is two times the outcome of X. We therefore have 20 minutes. 
The goal of this paper is to show how we can find the PDF of Y by knowing X and the relation- 
ship between X and Y. 

Of course, this is an oversimplified example since most probability students can guess that a 
uniform distribution would now have a probability of 2Q mi 1 nutes • By going through this exercise, 
the goal is to understand the mechanism of how we can transform any continuous distribution. 



Let us once again start with a distribution of X. The distribution of X is related to a distri- 
bution of Y such that Y = f(X). The distribution of Y is also related to X such that 

x=r i (Y). 

In our case 

y = f(x) = 2x equation: 1 

x = f~ 1 (y) = \y equation: 2 

Before I go on further, let me explain the notation of X and x. The capital X is the random 
variable while x is the actual number within the random variable. So in our case, X is the 
random variable from 0 to 10 while x may be the number 5 within 10 minutes. The same idea 
goes with Y and y. The capital Y is the distribution while y is the actual variable. 

The cdf of X and Y are therefore: 

F x (x)=p(X<x) equation: 3 

F y (y)=p(Y<y) equation: 4 

If we plug x from equation 2 into x in equation 3 we would get: 

F x (x)=p(f- 1 (y) <x)= p(\y<x)=p(y<2x)=F y {2x) 
This is a very important transition. The key we want to show here is that 

F x (x) = F y (2x) F x (\y) = F y (y) F^/" 1 ^)) = F y (y) 

In english, this basically means that the CDF of X is related to the CDF of Y. In our 
example we know that the probability of x is — — 1 , . To find the CDF we would take the 
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integral of the pdf. 

Now if we want to get the CDF of y we use the relationship 
F x (f-\y))=F y (y) => ±(L y ) = ± y 

Now if we want to convert from CDF back to PDF we would take the derivative of the 
CDF with respect to y. 

From this let's now go over the step by step instruction on transformation. 

1. Take the integral of the PDF you are transforming. 

2. Plug the inverse relationship between X and Y into the CDF of X 

3. Take the derivative of the CDF of X 

Part 2 

How do you generate different random variables using a 0 to 1 uniform distribution 
From the equation above 



F x (x)=F y (f- 1 (y)) 
Since we are using the uniform distribution 

F x (x) = x 
This means that we really have 

x = F y (f- 1 (y)) 

For example, let's say we want to generate a poisson random variable using a uniform distri- 
bution. 

The formula for the poisson distribution is: 
P( n ) = e - A 7TT 

With A of 4 we would have a distribution that looks like this 
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And a CDF that looks like this 
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If you want to generate a poisson distribution, all you have to do is generate a random uni- 
form variable from 0 to 1. This will represent the y axis on the CDF graph above. Once you 
have the y axis, you simply match it up with the closest corresponding x axis. So let's say if 
your uniform generator outputs 0.6, in the code you would use a look up table and find that you 
have really generated the number 4. 

Let's take a look at a gaussian distribution as well. 




If we let the mean to be zero and variance equal to 1, we would have a graph like this. 
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The CDF of the gaussian 




We would generate the gaussian distribution the same way. We first generate a random uni- 
form number from zero to one. Then we'll find the the corresponding x axis on the CDF graph. 



This is hopefully a help for some people. It is normally a pain to generate gaussian distribu- 
tion in c or many other programming languages. I think I have messed up something in my 
derivation at the first section, I am probably not using the most perfect notation, if you noticed 
the mistake, please let me know. 



I can be contacted at juliusctw@yahoo.com 

You can also find my other writings and work at http://www.ourmedia.org/user/4887 



